(19) 



Europaisches Patentamt 
European Patent Office 
Office europeen des brevets 



(12) 



(43) Date of publication: 

24.01.2001 Bulletin 2001/04 

(21) Application number: 99401812.5 

(22) Date of filing: 20.07.1999 



{ ii) EP 1 071 258 A1 

EUROPEAN PATENT APPLICATION 

(51) mtci 7: H04L 29/06, G06F 17/30 



(84) Designated Contracting States: 


(72) 


Inventor: Thomas, David R. 


AT BE CH CY DE DK ES Fl FR GB GR IE IT LI LU 




06650 Opio (FR) 


MC NL PT SE 






Designated Extension States: 


(74) 


Representative: Potter, Julian Mark 


AL LT LV MK RO SI 




D. Young & Co., 






21 New Fetter Lane 


(71) Applicants: 




London EC4A 1 DA (GB) 


• TEXAS INSTRUMENTS INC. 






Dallas, Texas 75243 (US) 






Designated Contracting States: 






BE CH DE DK ES Fl GB GR IE IT LI LU MC NL AT 






CY 






• TEXAS INSTRUMENTS FRANCE 






06271 Villeneuve Loubet Cedex (FR) 




■ i 


Designated Contracting States: 






FR 







(54) User access monitoring in internet 

(57) A method for operating a computer system link- 
able to at least one of a plurality of addressable entities 
in a computer network is disclosed. The method com- 
prises deriving a value corresponding to a number of 
accesses to an address associated with a one of said 



plurality of entities, and automatically storing local to 
said computer system computer implementable instruc- 
tions and/or data comprising said one of a plurality of 
said entities for said value fulfilling a predetermined cri- 
terion. 
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Description 

[0001 ] The present invention relates to computer sys- 
tems. In particular, but not exclusively, to computer sys- 
tems operable within the Internet. s 
[0002] To enhance their functionality, and to share re- 
sources, for example printers, scanners, etc, individual 
computer systems are often linked together to form 
computer networks. Such networks may be limited to 
individual organisations, and even to discrete geograph- io 
ical locations for that organisation. For example, a par- 
ticular building or even floor within a building. However, 
some computer networks comprise computer systems 
which are geographically located a greater distance 
from each other, even for example in other countries, is 
Often, such large area networks are not limited to indi- 
vidual computer systems, but link computer networks, 
so that a large area network of computer networks is 
formed. 

[0003] A particularly important example of such a net- 20 
work of networks is the Internet. The Internet comprises 
a network of computer systems and computer system 
networks spanning substantially the whole of the devel- 
oped world. No one organisation owns or is responsible 
for the Internet as a whole. The computer systems and 25 
computer networks linked together by the Internet may 
be owned and operated by different organisations. Each 
computer system within the Internet has a unique ad- 
dress and can communicate with other computer sys- 
tems within the Internet by using the other system's ad- 30 
dress. The format for communication between computer 
systems on the Internet is defined by the Internet Pro- 
tocol (IP), in order to ensure that all the systems on the 
Internet can understand and communicate with each 
other. The telecommunication systems utilised by the In- 35 
ternet may comprise ISDN services, Plain Old Tele- 
phone Systems (POTS), mobile or wireless communi- 
cation systems such as cellular or satellite radio tele- 
phone systems, cable communication or any other suit- 
able telecommunications medium. 40 
[0004] A particularly useful piece of software for com- 
puter networks is something known as the World Wide 
Web (WWW). The WWW is most often used with the 
Internet, which fully utilises its functionality, but the soft- 
ware may also be used with other networks. 45 
[0005] The term WWW is also used to refer to the 
body of information embodied in different media such 
as text, audio or video, for example, and stored on com- 
puter systems within the computer network, for example 
computer systems forming part of the Internet. The in- so 
formation is divided up into suitable segments, typically 
relating to the nature of the information, for example, an 
image may be formed as a graphics file, and a document 
as a text file. Additionally, such information may also 
comprise so-called Web pages which are datafiles com- ss 
prising information, which may comprise different me- 
dia, and instructions on how to format that information 
when it is displayed on a computer screen. Within the 



WWW, not only do individual computer systems have 
addresses, but individual pieces of information. Thus, 
by setting the correct address, it is possible from one 
computer system to access not just another computer 
system but a particular piece of information stored on 
that computer system. In the following the term "entity" 
shall be used to refer to any addressable information 
within a network including individual computer systems, 
documents, websites and pages and parts thereof, text, 
audio and video files for example, unless the context dic- 
tates otherwise. In the terminology of the WWW, the ad- 
dress of an entity is known as a Uniform Resource Lo- 
cator (URL). 

[0006] The previously-mentioned entities (Web pag- 
es) are often accessed by users of the WWW, and often 
comprise electronic versions of newspapers or journals, 
for example. Often, the computer system at which a par- 
ticular entity resides is geographically distant from the 
computer system of a user who wishes to access that 
entity. Additionally, the telecommunications system over 
which such access must be made may be a particularly 
busy communication system. Consequently, it is often 
the case that accessing a desired entity and download- 
ing it to the user's computer system is a lengthy process. 
Not only is the time delay irritating to a user, but it may 
be costly since the user is likely to be paying local tele- 
phone company charges for or during that access time. 
The entity is typically downloaded and cached on the 
user's computer system i.e. (stored in the user's com- 
puter system local memory, e.g. internal hard disk 
drive), in order to release the telecommunications link 
and the entity server, and allow the user to then review 
the entity stored on their own computer system, for ex- 
ample on the hard drive. 

[0007] The delay and cost in accessing entities, over 
the Internet is inhibiting the development and use of the 
Internet, in particular by consumers. 
[0008] A known system which seeks to address the 
aforementioned problems comprises a Web TV pro- 
duced by NEC incorporating a NAVIO TV navigator pro- 
duced by NAVIO Communications Inc. In this known 
system, a user's computer system hard disk is updated 
with Web pages (entities) defined by so-called book- 
marks of the user at regular intervals. In this context 
"bookmarks" are a list of entity addresses stored in a 
user's computer system, and forming a part of a "drop- 
down" menu such that when the menu is activated the 
list of addresses is displayed and a user can "click" on 
an address to automatically access that entity. 
[0009] The system updates the hard disk drive at mid- 
night each day. However, if entities do not have a low 
activity at night, for example they are located in coun- 
tries in a different time zone where the time may be dur- 
ing the middle of the working day, the access time for 
the entitieses may be relatively long. Since the proposed 
system envisages accessing somewhere between 
5,000 to 10,000 entities (web pages) each time, it is ev- 
ident that significant time would be required and there- 



BNSDOCID: <EP 1071258A1_I_> 



2 



EP 1 071 258 A1 



fore costs incurred, for downloading the entities. Addi- 
tionally, the proposed system assumes that bookmarks 
accurately represent the computer system user's cur- 
rent requirements and desires. However, it is common 
that a user does not delete bookmarks when they are 
no longer interested in the particular topic, but retains 
them until such time as the file comprising the book- 
marks is of unreadable or unusable size and then re- 
moves unwanted bookmarks. Furthermore, the more 
successful or more common the system, the greater the 
problem regarding access times since one will have 
many systems within a particular time zone accessing 
or using the Internet at the same time. 
[0010] The embodiments in accordance with the 
present invention seek to ameliorate and address at 
least some of the aforementioned drawbacks and prob- 
lems. 

[0011] Particular and preferred aspects of the inven- 
tion are set out in the accompanying independent 
claims. Combinations of features from the dependent 
and/or independent claims may be combined as appro- 
priate and not merely as explicitly set out in the claims. 
[0012] An embodiment in accordance with a first as- 
pect of the invention provides a method for operating a 
computer system linkable to at least one of a plurality of 
addressable entities in a computer network. The method 
comprises deriving a value corresponding to a number 
of accesses to an address associated with a one of said 
plurality of entities, and automatically storing, local to 
said computer system, computer-implementable in- 
structions and/or data comprising said at least one of a 
plurality of addressable entities for said value fulfilling a 
predetermined criterion. 

[0013] An embodiment in accordance with a second 
aspect of the present invention provides apparatus op- 
erable to communicate with respective ones of a plural- 
ity of addressable entities within a computer network. 
The apparatus comprises communication means for 
communicating with a one of said plurality of entities, 
and means for deriving a value corresponding to a 
number of accesses to said one of a plurality of entities. 
There is further provided means for automatically stor- 
ing local to said apparatus computer-implementable in- 
structions and/or data corresponding to said one of a 
plurality of entities for said value fulfilling a predeter- 
mined criterion. 

[0014] Embodiments in accordance with the first and 
second aspects of the invention respectively provide a 
method of operating a computer system and apparatus, 
in which the frequency of access to particular entities 
may be monitored and the most popular, according to a 
user-defined criterion for example, are regularly cached 
to a memory local to the computer system, in order to 
provide substantially instant access to the relevant en- 
tity information next time the user accesses it. Thus : the 
user does not have to sit and wait for entity data to be 
downloaded over the computer network each time he 
accesses the entity, and therefore is able to save con- 



siderable amounts of time. 

[0015] In preferred. embodiments of the invention, ac- 
cess to one of the plurality of entities is monitored and 
a count value corresponding to an access to one of the 

5 plurality of entities is updated for each access. In this 
respect, it should be noted that if an access is made to 
a URL which is at a lower domain level than a pre-ex- 
isting URL, then the count value for the pre-existing URL 
is updated for the access to the lower domain level URL. 

10 Optionally, separate counts may be made for each spe- 
cific URL accessed by a user. An advantage of the first 
option is that it reduces the number of times access has 
to be made to an entity since the highest level domain 
information is stored. 

is [0016] Preferably, the periodicity of access to the en- 
tities is determined. Such periodicity is typically meas- 
ured in days, weeks or months, although sub-units such 
as hours may also be used if appropriate for the frequen- 
cy of access. Thus, it is possible to modify a count value 

20 to take into account the intervals at which a user access- 
es the particular entity. Thus, a count value for an entity 
which is accessed on a monthly basis, but still regularly, 
that is to say at least once a month, may be ascribed a 
count value which compares with a entity which is ac- 

25 cessed daily. This may be achieved by modifying the 
count value corresponding to an access to an entity by 
an amount determined by the periodicity of accesses. 
[0017] Typically, the count value is updated in count 
intervals of 1 , however, where the count value has been 

30 modified in accordance with the periodicity of access to 
the at least one of a plurality of entities, then the count 
interval for that entity will be greater than 1 . 
[0018] Suitably, a count value is decreased if the time 
period since a last access to the corresponding entity 

35 exceeds a predetermined time interval. Thus, the inven- 
tion automatically adjusts the count values for the enti- 
ties in accordance with the user's current access profile. 
Thus, entities which are no longer of interest to the user, 
that is to say are no longer being accessed, are gradu- 

40 ally removed from the user profile by virtue of their count 
value being systematically and automatically de- 
creased. 

[0019] The criterion for automatically storing the com- 
puter-implementable instructions and/or data compris- 
es ing an entity may be for the count value corresponding 
to that entity exceeding a predetermined threshold val- 
ue. Thus, any entities which are of sufficient interest to 
a user such that the number of accesses exceeds that 
threshold will be automatically stored. Optionally, the 
50 predetermined criterion may be that the count value is 
one of a group of highest count values, for example the 
10, 20 or 30 highest count values. In this way, the user 
access profile becomes a "top 10" etc of user-accessed 
entities. Such a process advantageously limits the 
55 number of entities being cached to the predetermined 
number, i.e. 10, 20, 30, etc, as desired. 
[0020] In a particular embodiment, a further or second 
count value is incremented corresponding to accesses 



BNSDOCID: <EP 1071258A1 J_> 



5 



EP 1 071 258 A1 



6 



to one of a plurality of entities. The first count value is 
then incremented for the second count value exceeding 
threshold value, and then the second count value is re- 
set to zero. Such an embodiment is a suitable way of 
providing a balance between accesses to entities of dif- 
ferent periodicities. For example, an entity whose peri- 
odicity is frequent, for example once a day, is likely to 
be provided with a second count value with a threshold 
relating to a typical month, i.e. 30. Thus, the first count 
value upon which the user profile is based, is only incre- 
mented every 30 accesses, by virtue of the second 
count value only initiating increment of the first count 
value for the second count value reaching the threshold. 
In this way, access on a daily basis can be balanced with 
access on a monthly basis, such that the user profile 
reflects the value of the entities and the consistency of 
their use, as opposed to the mere frequency of use, 
which might give mis-leading results. 
[0021] Suitably, the periodicity with which an entity is 
updated is deterrninedby inspecting the "last modified 
date" field in a sub-file for example, associated with the 
entity. Thus the frequency and/or optimum time for au- 
tomatically caching that entity to obtain the most up-to- 
date information may be determined. 
[0022] Embodiments in accordance with the invention 
may comprise a computer program comprising machine 
implementable instructions and/or data for operating a 
computer system in accordance with the methods de- 
scribed above. Such a computer program may be em- 
bodied on a carrier medium such as a magnetic disc, 
optical disc or semiconductor memory, for example. Op- 
tionally, the carrier medium may comprise a radio fre- 
quency carrier wave or an optical carrier wave or other 
carrier wave suitable for telecommunications, in order 
to provide remote distribution of a computer program 
embodying the invention. 

[0023] Specific embodiments in accordance with the 
present invention will now be described, by way of ex- 
ample only, and with reference to the drawings, in which: 

Figure 1 is a schematic illustration of a computer 
network comprising a server and client computer 
system; 

Figure 2 is a block diagram illustrating the compo- 
nents of a computer system of Figure 1 ; 
Figure 3 is a schematic illustration of a weighted 
stack in accordance with a first embodiment of the 
invention; 

Figure 4 is a schematic illustration of the weighted 
stack of Figure 3 after updating; 
Figure 5 is a schematic illustration of a data entry 
format of a weighted stack in accordance with a sec- 
ond embodiment of the invention; 
Figure 6 is a schematic illustration of a data entry 
found for a weighted stack in accordance with a 
third embodiment of the invention; 
Figure 7 illustrates a flowchart for a client module 
in accordance with a first aspect of the invention for 



creating an ordered stack of addresses; 

Figure 8 illustrates a flowchart for a client module 

operable for auto-caching; 

Figure 9 is a schematic illustration of a data entry 
s format in accordance with a second aspect of the 
invention; and 

Figure 10 illustrates a flowchart for a client module 
in accordance with a second aspect of the inven- 
tion. 

io 

[0024] Referring now to Figure 1 , there is illustrated a 
schematic representation of a network of computer sys- 
tems, such as the Internet, comprising a server compu- 
ter system 10 and client computer systems 11 . Both the 

75 server computer system 1 0 and the client computer sys- 
tem 1 1 comprise similar components, for example a sys- 
tem unit 12, a display device 18 with a display screen 
20, and user input devices, including a keyboard 22 and 
a mouse 24. A printer 21 is also connected to the sys- 

20 tern. Each system unit 12 comprises media drives, in- 
cluding an optical disk drive 14, a floppy disk drive 16 
and an internal hard disk drive not explicitly shown in 
Figure 1 . A CD-ROM 1 5 and a floppy disk 17 are also 
illustrated. Additionally, server computer system 10 

2S comprises high capacity storage media, such as further 
magnetic hard disk 1 9, for example. 
[0025] A program, comprising a sequence of compu- 
ter-implementable instructions and data, for implement- 
ing various functions or conveying various information 

30 may be supplied on media such as one or more CD- 
ROMs and/or floppy disk, then stored on a hard disk, for 
example. The computer system shown in Figure 1 is al- 
so connected 26 to a network 2, which in the illustrated 
embodiment is the Internet, but may be a local or wide 

3S area dedicated or private network, for example. A pro- 
gram or data implementable on a computer system may 
also be supplied on a telecommunications medium, for 
example over a telecommunications line by a network 
and/or the Internet. For a client computer system 11 op- 

40 erating as a mobile terminal over a radio telephone net- 
work, the telecommunications medium may be a radio 
frequency carrier wave carrying suitably encoded sig- 
nals representing the computer program and data or in- 
formation. Optionally, the carrier wave may be an optical 

45 carrier wave for an optical fibre link or any other suitable 
carrier medium, for a landline linked telecommunica- 
tions system. A mains electrical wiring system may also 
be utilised, for example, for caching information to a 
portable electronic device, such as a personal Digital 

so Assistant (PDA), when it is coupled to the mains for re- 
charging its batteries. 

[0026] Referring now to Figure 2, there is shown a 
"schematic and simplified representation of an exempla- 
ry implementation of a computer system such as that 
55 referred to with reference to Figure 1 . As shown in Fig- 
ure 2, a processor (CPU) 30 is coupled to a bus structure 
38. Also connected to the bus structure 38 are read only 
memory 32 and random access memory 34. A display 
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adaptor 36 connects a display device 18 to the bus 
structure 38. One or more user-input device adapters 
40 connect the user-input devices, including the key- 
board 22 and mouse 24 to the bus structure 38. An 
adapter 41 for the connection of the printer 21 may also 
be provided. One or more media drive adapters 42 can 
be provided tor connecting the media drives, for exam- 
ple the optical disk drive 1 4, the floppy disk drive 1 6 and 
hard disk drive 1 9, to the bus structure 38. One or more 
telecommunications adapters 44 can be provided for 
connecting the computer system to one or more net- 
works. The communications adapters 44 could include 
a local area network adapter, a modem and/or ISDN ter- 
minal adapter, etc as required. / 
[0027] It will be appreciated that Figures 1 and 2 are 
schematic representations of one possible implementa- 
tion of a computer system, suitable for either a server 
computer system 10 or a client computer system 11. It 
will be appreciated from the following description of em- 
bodiments of the present invention that the computer 
^ system in which the invention could be implemented 
may take many forms. For example, rather than the 
server computer system 1 0 comprising a display device 
18 and printer 21, it may be merely necessary for the 
server computer system 10 to comprise a processing 
unit, and be accessible by client computer systems 11. 
The client computer may also be a non-PC type of com- 
puter which is Internet or network compatible, for exam- 
ple a Web TV, or set-top box for a domestic TV capable 
of providing access to a computer network such as the 
Internet. Further applications for the client computermay 
be as an "electronic book" or "newspapaer", having suit- 
ably daily or weekly updates. Optionally, the client com- 
puter may be in the form of a wireless PDA or a multi- 
media terminal. 

[0028] Each computer system 10, 11 has a unique ad- 
dress, within the Internet, and in the terminology of the 
WWW, these addresses are known as Uniform Re- 
source Locators (URLs). Additionally, each entity within 
the WWW may also have a unique address or URL. An 
) entity may comprise many different types of information, 
for example text, graphics, audio, video, etc, and is 
therefore referred to as an hypermedia document or en- 
tity. Files comprising a single medium such as a graph- 
ics file, audio file, video file, etc may also be found on 
the WWW and are individually addressable. Within the 
WWW, hypermedia entities are created and handled us- 
ing a programming language called Hypertext Markup 
Language (HTML). HTML provides a method of format- 
ting entities in which the information may be separated 
from the presentation of the entity. That is to say entities 
containing the same information may be presented in 
different ways. 

[0029] WWW software is based on a client-server ar- 
chitecture. A Web client, for example a browser, is a 
computer program which can send requests for informa- 
tion, such as documents, to a Web server. A Web server 
is a program which, in response to a request from a cli- 



ent, sends enitites (documents), or error messages if 
appropriate, requested back to the requesting client. 
The Web server resides on server computer system 10. 
The entity received by the client is stored on the client 

5 computer system 1 1 , typically on the hard disk drive 1 9 
in a process known as "caching". 
[0030] The client program typically resides in the hard 
disk drive 19 of the client computer system 11, and is 
operable to configure client computer system 11 to in- 
to terface with the Internet and WWW 

[0031] In a first embodiment in accordance with the 
invention, the client comprises a software module com- 
prising computer implementable instructions for deter- 
mining a user access profile to WWW pages or sites. 

is The module utilises the user profile to automatically (au- 
to)-cache entities accessed by the user in accordance 
with the determined user access profile. Auto-cashing 
occurs at a suitable time as may be determined by the 
user, or automatically in accordance with the user ac- 

20 cess profile. 

[0032] The user profile is determined by monitoring 
access to entities initiated by the client. The first time a 
URL is accessed by the client, the address is stored to- 
gether with a count value which is initialised to 1 . Each 

25 time that address is accessed by the client, the count 
value is increased by 1 . Those entities having an access 
profile value greaterthan a threshold value, for example, 
are auto-cached by the client and stored on the client 
computer system hard disk drive 1 9 ready for almost in- 

30 stant access the next time the user enters one of the 
addresses corresponding to the auto-cached entities. 
The threshold value may optionally comprise the 
number of sites to be automatically-cached, such that 
only the top twenty most accessed sites, for example, 

35 are auto-cached. 

[0033] The entity addresses may be stored in a 
weighted stack 300 as shown in Figure 3. Each data en- 
try 302 of weighted stack 300 comprises two parts. A 
first part 304 for storing an entity address, and a second 

40 part 306 for storing the current access profile value. The 
stack is ordered in accordance with the access profile 
value 306, with the higher access profile values being 
stored at the top of the stack. Within the weighted stack 
300, there is a threshold 308. Addresses stored above 

45 this threshold 310 will be auto-cached, and addressed 
stored below the threshold 31 2 will not be auto-cached. 
[0034] Referring now to Figure 4, there is illustrated 
an example of the weighted stack 300 after an update 
subsequent to client access to entities. In the particular 

50 example illustrated in Figure 4, the client has accessed 
ADDRESS B twice since the situation reflected in Figure 
3. Consequently, ADDRESS B has now moved up a 
place in weighted stack 300, and ADDRESS D has been 
moved down a place since no further accesses have 

55 been made to it and its access count remains at 21. Ad- 
ditionally, ADDRESS C has been accessed three times 
since the time represented by the weighted stack illus- 
trated in Figure 3, such that its access profile value is 
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now 11. Not only has ADDRESS F not been accessed 
since the period reflected by the weighted stack illustrat- 
ed in Figure 3, but has also not been accessed for a pre- 
set period resulting in its access profile value being re- 
duced by 1 to 9. Furthermore, ADDRESS E has been 
accessed three times, thereby resulting in an access 
profile value of 10. Consequently, ADDRESS F has fall- 
en to two places below threshold 308 in weighted stack 
300. Meanwhile ADDRESS C is now just above thresh- 
old 308, whilst ADDRESS E is just below the threshold 
308. Thus, a user access profile is built up and recorded 
in the weighted stack 300 in accordance with the 
number of accesses made to entity addresses. 
[0035] The pre-set period after which no access has 
been made to a particular address such that the relevant 
access profile value is decreased may be any suitable 
period as determined by the user, for example one 
week, four weeks, six weeks, etc. Optionally, the anoth- 
er criterion may be used for determining decreasing the 
access profile value may be based on historical access 
trends, such that a value is given by 
Y*cache_access_frequency, where Y is a suitable 
number representing a time period or interval, for exam- 
ple a day, a week etc. If the value falls below a threshold 
value the the relevant access profile value is decreased. 
The cache_access_frequency is measured in the same 
time intervals. 

[0036] The threshold 308 may be based on the 
number of sites which are to be auto-cached, such that 
a certain number of storage locations are available 
above the threshold, for example 20. Optionally, thresh- 
old 308 may be based on a minimum number of access- 
es to an address, for example 20, as recorded by the 
access profile value. Addresses having an access pro- 
file value exceeding the threshold 308 will be auto- 
cached, whilst addresses not exceeding the threshold 
will not be auto-cached. 

[0037] Preferably, if an entity is accessed at an ad- 
dress which is an extension of an entity address that has 
already been weighted within the stack, then the pre- 
existing upper level address has its access profile value 
updated. 

[0038] Entities that are accessed often by a user will 
typically be periodically updated. Such updates may be 
weekly or some other suitable period such as fortnightly 
or monthly. 

[0039] Although entities updated at relatively long pe- 
riods such as weekly, fortnightly etc are not accessed 
frequently by a user, they are nevertheless accessed 
regularly and a user would wish to have prompt access 
to them without delay. Thus, it is desirable that such sites 
are pre-cached. A problem exists, however, in that be- 
cause. of their relatively infrequent access to such sites, 
the address of such sites is likely to fall well below the 
threshold 308 of a weighted stack 300 as illustrated in 
Figures 3 and 4. 

[0040] I n order to address this problem, a second em- 
bodiment in accordance with the invention provides for 



a user-definable or automatically derivable flag to be in- 
cluded in a data entry for the stack 300. An example of 
a suitable data entry format 502 is illustrated in Figure 
5. The data entry 502 comprises an address 504, and 
5 an access profile value storage area 506. The access 
profile value N is derived from the following relationship: 

N = nx flag, 

10 

where n is the number of actual accesses to the address 
stored in 504 and flag is a user-definable value, or au- 
tomatically derived value, depending upon the typical 
periodicity of access to the address stored in location 
is 504. For example, the falg may be automatically derived 
from the time interval since the last update of data for 
the entity. Such information may be obtained from "page 
info" entries for a webpage for example. 
[0041] A user may define the flag to be equal to 1 for 
20 an address which is typically accessed once a day, 

whilst a flag may be set to 7 for weekly access or 31 for j 
monthly access. In this manner, sites which are regularly 
accessed, but at relatively lengthy intervals, may still ap- 
pear above threshold value 308 and be auto-cached. 
25 [0042] Optionally, in accordance with a third embodi- 
ment of the invention, the data entry of a weighted stack 
may be formatted such that a flag defines a class of ad- 
dress having a respective threshold value for access be- 
fore the access value profile can be updated. An exam- 
30 pie of a data entry format 602 in accordance with the 
third embodiment is illustrated in Figure 6. Data entry 
602 comprises an address storage location 604 and an 
access profile value N storage location 606. A user-de- 
finable or automatically-definable flag is stored in loca- 
ls tion 608, and defines the class of the corresponding ad- 
dress stored in location 604. 

[0043] Data entry format 602 also comprises a stor- 
age area 610 for storing an access counter, n, which is 
incremented by 1 each time the address stored in loca- 

40 tion 604 is accessed. The flag stored in 608 defines a 

threshold for access counter n. Each time that threshold ) 
value is achieved profile counter N is updated by 1 , and 
access counter n is reset to zero. For example, a flag 
corresponding to an address which is likely to be ac- 

45 cessed daily may define a threshold value of 20, one 
that is updated weekly a threshold value of 5 and one 
that is updated monthly a threshold value of 1. 
[0044] The operation of a suitable client module for 
monitoring and sorting entity addresses into an ordered 

so sequence for determining which should be auto-cached 
and which should not, and for performing auto-caching, 
will now be described with reference to the flowcharts 
illustrated in Figure 7 and Figure 8. 
[0045] The operation of a computer system in accord- 

ss ance with the flowchart of Figure 7 starts at step 702 
and proceeds to step 704 where it is determined wheth- 
er or not an Internet site has been addressed. If no, then 
the process flows back to the start at step 702. If yes 
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then process control flows to step 706 where the ad- 
dress is compared with addresses waiting in stack 300 s 
and at step 70S it is determined whether or not an entry 
exists in stack 300 for that address. 
[0046] If no entry exists in the stack 300 then the proc- 
ess control flows to step 710 where the appropriate 
stack entry is created, and the step 71 2 the access pro- 
file value for the new stack is initialised. 
[0047] If an entry were to exist for that address then 
the process control flows to step 714 where the access 
profile value is updated for that address. Next, step 71 6, 
the address is awarded within the stack, and then the 
process ends at step 718. 

[0048] Turning now to Figure 8 there is illustrated a 
flow chart for the process of auto-caching contents of 
entities in accordance with a user access profile. Pref- 
erably the client module comprises software capable of 
configuring the client computer system to operate in ac- 
cordance with the flowchart of Figure 8. 
[0049] The computer system is configurable such that 
\ at a suitable time determined by the user or automati- 
cally in accordance with the user access profile, auto- 
caching is initiated, step 802. Next, step 804, the stack 
300 is interrogated to determine whether or not there 
are more entries left for auto-caching. If the result of step 
804 is no then process control flows to step 814 and the 
software module ceases operation. However, if the re- 
sult of step 804 is yes, then process control flows to step 
806 where the next entry in stack 806 is selected. At 
step 808, the entry is inspected to determine whether or 
not it meets the criterion, that is to say exceeds the 
threshold 308 or not. If not then process control flows to 
step 8418 where it stops. If the criterion is met then proc- 
ess control flows to step 810 where the entity address 
corresponding to the stack entry is accessed, and at 
step 81 2 the contents are stored locally to the computer 
system for example, on the local hard disk drive. Proc- 
ess control then flows to step 804. The process ends at 
step 814. 

[0050] In a preferred embodiment of the invention, the 
J default option for auto-caching entity addresses is daily. 
However, as discussed above, some sites are periodical 
entities, for example weekly or monthly journals and 
consequently are often updated once a week or once a 
month. Such updating typically occurring on the same 
day of each week or month. Thus, it would be preferable 
not to auto-cache such sites on a daily basis, or a basis 
more often than the sites are updated. 
[0051] Associated with a web page, are details 
termed "page info" in which is included the last dateAime 
on which the page was modified or updated. Addition- 
ally, each sub-file comprising a web page has an asso- 
ciated "last modified date" statement. The "page info" or 
"last modified date" statement may be utilised by a client 
module in accordance with a second aspect of the in- 
vention to identify the date/time on which the page or 
site was last updated. The client module may be config- 
ured to access the page info or last modified date state- 



ment in order to log it each time the address is accessed, 
and to determine the frequency at which modification or 
update of the entity/page occurs. The update or modifi- 
cation frequency can then be utilised to create an auto- 

5 cache frequency which may be included in a data entry 
such as one formatted as illustrated in Figure 9. The da- 
ta entry format 902 comprises an address storage loca- 
tion 904, a profile counter evaluation storage location 
906, which comprises the profile counter N, or any flags 

10 or access counters n in accordance with the second and 
third embodiments of the invention. Additionally, data 
entry format 902 comprises an auto-cache frequency 
storage location 908, in which the auto-cache frequency 
F is stored. The auto-cache frequency F will typically 

15 comprise the date and/or time for the next auto-caching 
of the corresponding address stored in location 904. 
[0052] Typically, the frequency F will be determined 
to be the next day, or an hour or so later than the deter- 
mined regular update derived from the page info or last 

20 modified date statement included in the entity/page. Op- 
tionally, a more suitable time for auto-caching the site/ 
page may be defined. 

[0053] The operation of a client module in accordance 
with a preferred embodiment of the invention for deriving 
25 an auto-cache frequency value F for an address will now 
be described with reference to the flowchart illustrated 
in Figure 10. 

[0054] The process for determining an auto-cache 
frequency value F begins at step 1002 of the flowchart 

30 illustrated at Figure 10. 

[0055] The process starts at step 1 002 when an entity 
(webpage for example) has been accessed by the client 
computer system. At step 1 004 the page information, or 
last modified date of the webpage is read, and the last 

35 modified date stored at step 1 006. Next, step 1008, the 
periodicity of the updates for the entity is determined, 
by analysing the most recent updated date, and a set of 
historic updated dates. 

[0056] The update periodicity may be calculated by 
40 determining the number of days since the entity was last 
modified and storing that number of days for a current 
access to the entity. Then the number of days obtained 
for the current update is subtracted from the number of 
days since last modification stored for a previous ac- 
45 cess. If the result is less than 1, then the periodicity is 
set to the most recently stored number of days since the 
last modification. 

[0057] Next, step 1010, the auto-cache interval or 
date defining the auto-cache frequency value F is set, 

50 and the process ends at step 1012. Thus, it is possible 
to determine at what intervals auto-caching should be 
carried out, and also if there is sufficient information re- 
garding the last modification, the time of day by which 
the auto-caching should be carried out in order to ensure 

55 the most up-to-date information has been cached. 

[0058] In view of the foregoing description it will be 
evident to a person skilled in the art that various modi- 
fications may be made within the scope of the invention. 
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[0059] The scope of the present disclosure includes 
any novel feature or combination of features disclosed 
therein either explicitly or implicitly or any generalisation 
thereof irrespective of whether or not it relates to the 
claimed invention or mitigates any or ail of the problems 
addressed by the present invention. The applicant here- 
by gives notice that new claims may be formulated to 
such features during the prosecution of this application 
or of any such further application derived therefrom. In 
particular, with reference to the appended claims, fea- 
tures from dependent claims may be combined with 
those of the independent claims and features from re- 
spective independent claims may be combined in any 
appropriate manner and not merely in the specific com- 
binations enumerated in the claims. 



Claims 

1 . A method for operating a computer system linkable 
to at least one of a plurality of addressable entities 
in a computer network, the method comprising: 

deriving a value corresponding to a number of 
accesses to an address associated with a one 
of said plurality of entities; and 
automatically storing local to said computer 
system computer implementable instructions 
and/or data comprising said one of a plurality 
of said entities for said value fulfilling a prede- 
termined criterion. 

2. A method according to Claim 1 , the step of deriving 
comprising: 

monitoring respective accesses to said one of 
said plurality of entities; and 
updating a count value corresponding to said 
access to said one of a plurality of entities. 

3. A method according to Claim 2 further comprising 
determining a periodicity of access to said one of a 
plurality of entities. 

4. A method according to Claim 3 said step of updating 
said count value comprising updating said count 
value by an amount depending on said periodicity 
of access to said one of a plurality of entities. 

5. A method according to Claim 4, said step of updat- 
ing said count value comprising updating said count 
value in count intervals greater than 1. 

6. A method according to any one of Claims 2 to 5, 
said step of updating comprising increasing said 
count value. 

7. A method according to any one of Claims 2 to 6, 



said step of updating further comprising decreasing 
said count value for a time period since a last ac- 
cess to said one of a plurality of entities exceeding 
a predetermined time interval. 

5 

8. A method according to any preceding Claim, said 
predetermined criterion comprising said count val- 
ue exceeding the threshold value. 

10 9. A method according to any one of claims 2 to 7, said 
predetermined criterion comprising said count val- 
ue being a one of a group of highest count values. 

10. A method according to any preceding Claim de- 
15 pendent on Claim 2, further comprising increment- 
ing a second count value corresponding to access- 
es to said one of a plurality of entities; and 

incrementing said value for said second count 
20 value exceeding a threshold value; and 

resetting said second count value to zero. 

11. A method according to any preceding Claim, further 
comprising determining a regular time at which said 

25 entity is ipdated. 

12. A method according to any preceding Claim, further 
comprising determining an update periodicity or 
time interval for said entity. 

30 

13. A computer program comprising machine imple- 
mentable instructions for performing a method ac- 
cording to any one of Claims 1 to 12. 

35 14. A carrier medium comprising a computer program 
according to Claim 1 3. 

15. A carrier medium according to Claim 14, comprising 
memory means including a magnetic disk memory, 

40 an optical disk memory or a semiconductor memo- 
ry. 

16. An article of manufacture comprising a carrier me- 
dium according to Claim 1 4 or Claim 1 5. 

45 

1 7. A carrier medium according to Claim 1 4 or 1 5, com- 
prising a radio frequency carrier wave or an optical 
carrier wave. 

50 18. Apparatus operable to communicate with respec- 
tive ones of a plurality of addressable entities within 
a computer network, comprising: 

communication means for communicating with 
55 at least one of said plurality of entities; 

means for deriving a value corresponding to a 
number of accesses to said at least one of a 
plurality of entities; and 
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means for automatically storing local to said ap- 
paratus information corresponding to said one 
of a plurality of entities for said value for filling 
a predetermined criterion. 

19. Apparatus according to Claim 18, further compris- 
ing: 

monitoring means for monitoring respective ac- 
cesses to said one of a plurality of entities; 
counter means; and 

means for updating said counter means corre- 
sponding to said access to said at least one of 
a plurality of entities. 

20. Apparatus according to Claim 19, further compris- 
ing means for determining a periodicity of access to 
said one of a plurality of entities. 

21. Apparatus according to Claim 20, wherein said 
means for updating said counter is operable to up- 
date said counter by an amount dependent on said 
periodicity of access to said one of a plurality of en- 
tities. 

22. Apparatus according to Claim 21, said updating 
means operable to update said counter in count in- 
tervals greater than 1. 

23. Apparatus according to any one of Claims 1 9 to 22, 
further comprising second counting means, said 
second counting means operable to increase a sec- 
ond counter value corresponding thereto for re- 
spective accesses to said one of a plurality of enti- 
ties; 

said updating means operable to increment 
said counter for said second counter value ex- 
ceeding a threshold; and 
said second counting means operable to reset 
said second counter value to zero for said sec- 
ond counter value exceeding said threshold. 

24. Apparatus according to any one of Claims 1 9 to 22, 
said updating means increasing said counter value 
for respective accesses to said one of a plurality of 
entities. 

25. Apparatus according to any one of Claims 1 9 to 24, 
said updating means operable to decrement said 
counter for a time period since a last access to said 
one of a plurality of entities exceeding a predeter- 
mined time interval. 

26. Apparatus according to any one of Claims 18 to 25, 
said predetermined criterion comprising said coun- 
ter value exceeding a threshold value. 
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27. Apparatus according to any one of Claims 18 to 25, 
said predetermined criterion comprising said coun- 
ter value being a one of a group of highest counter 
values. 
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